Effective scoring function for protein sequence design.

نویسندگان

  • Shide Liang
  • Nick V Grishin
چکیده

We have developed an effective scoring function for protein design. The atomic solvation parameters, together with the weights of energy terms, were optimized so that residues corresponding to the native sequence were predicted with low energy in the training set of 28 protein structures. The solvation energy of non-hydrogen-bonded hydrophilic atoms was considered separately and expressed in a nonlinear way. As a result, our scoring function predicted native residues as the most favorable in 59% of the total positions in 28 proteins. We then tested the scoring function by comparing the predicted stability changes for 103 T4 lysozyme mutants with the experimental values. The correlation coefficients were 0.77 for surface mutations and 0.71 for all mutations. Finally, the scoring function combined with Monte Carlo simulation was used to predict favorable sequences on a fixed backbone. The designed sequences were similar to the natural sequences of the family to which the template structure belonged. The profile of the designed sequences was helpful for identification of remote homologues of the native sequence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing optimal non-linear scoring function for protein design

UNLABELLED Motivation. Protein design aims to identify sequences compatible with a given protein fold but incompatible to any alternative folds. To select the correct sequences and to guide the search process, a design scoring function is critically important. Such a scoring function should be able to characterize the global fitness landscape of many proteins simultaneously. RESULTS To find o...

متن کامل

A conditional neural fields model for protein threading

MOTIVATION Alignment errors are still the main bottleneck for current template-based protein modeling (TM) methods, including protein threading and homology modeling, especially when the sequence identity between two proteins under consideration is low (<30%). RESULTS We present a novel protein threading method, CNFpred, which achieves much more accurate sequence-template alignment by employi...

متن کامل

On Simplified Global Nonlinear Function for Fitness Landscape: A Case Study of Inverse Protein Folding

The construction of fitness landscape has broad implication in understanding molecular evolution, cellular epigenetic state, and protein structures. We studied the problem of constructing fitness landscape of inverse protein folding or protein design, with the aim to generate amino acid sequences that would fold into an a priori determined structural fold which would enable engineering novel or...

متن کامل

Scoring functions for computational algorithms applicable to the design of spiked oligonucleotides.

Protein engineering by inserting stretches of random DNA sequences into target genes in combination with adequate screening or selection methods is a versatile technique to elucidate and improve protein functions. Established compounds for generating semi-random DNA sequences are spiked oligonucleotides which are synthesised by interspersing wild type (wt) nucleotides of the target sequence wit...

متن کامل

Prediction of functional engrailed homology-1 protein motif from sequence

Prediction of functional peptide motifs from sequences is an important problem in bioinformatics. Experimental discovery of functional sequences is laborious. Searches for specific motifs within the increasing number of proteins available in public databases often involve extensive computer calculations. Short peptide motifs are especially hard to identify via currently available methods. Prese...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proteins

دوره 54 2  شماره 

صفحات  -

تاریخ انتشار 2004